Alexander Sturn
نویسندگان
چکیده
Cluster Analysis for Large Scale Gene Expression Studies High throughput gene expression analysis is becoming more and more important in many areas of biomedical research. cDNA microarray technology is one very promising approach for high throughput analysis and provides the opportunity to study gene expression patterns on a genomic scale. Thousands or even tens of thousands of genes can be spotted on a microscope slide and relative expression levels of each gene can be determined by measuring the fluorescence intensity of labeled mRNA hybridized to the arrays. Beyond simple discrimination of differentially expressed genes, functional annotation (guilt-by-association) or diagnostic classification requires the clustering of genes from multiple experiments into groups with similar expression patterns. A platform independent Java package of tools has been developed to simultaneously visualize and analyze a whole set of gene expression experiments. After reading the data from flat files several graphical representations of hybridizations can be generated, showing a matrix of experiments and genes, where multiple experiments and genes can be easily compared with each other. Fluorescence ratios can be normalized in several ways to gain a best possible representation of the data for further statistical analysis. Hierarchical and non hierarchical algorithms have been implemented to identify similar expressed genes and expression patterns, including: 1) hierarchical clustering, 2) k-means, 3) self organizing maps, 4) principal component analysis, and 5) support vector machines. More than 10 different kinds of similarity distance measurements have been facilitated, ranging from simple Pearson correlation to more sophisticated approaches like mutual information. Moreover, it is possible to map gene expression data onto chromosomal sequences. The flexibility, variety of analysis tools and data visualizations makes this software suite a valuable tool in future functional genomic studies.
منابع مشابه
Client-Server environment for high-performance gene expression data analysis
SUMMARY We have developed a platform independent, flexible and scalable Java environment for high-performance large-scale gene expression data analysis, which integrates various computational intensive hierarchical and non-hierarchical clustering algorithms. The environment includes a powerful client for data preparation and results visualization, an application server for computation and an ad...
متن کاملCARMAweb: comprehensive R- and bioconductor-based web service for microarray data analysis
CARMAweb (Comprehensive R-based Microarray Analysis web service) is a web application designed for the analysis of microarray data. CARMAweb performs data preprocessing (background correction, quality control and normalization), detection of differentially expressed genes, cluster analysis, dimension reduction and visualization, classification, and Gene Ontology-term analysis. This web applicat...
متن کاملGenesis: cluster analysis of microarray data
A versatile, platform independent and easy to use Java suite for large-scale gene expression analysis was developed. Genesis integrates various tools for microarray data analysis such as filters, normalization and visualization tools, distance measures as well as common clustering algorithms including hierarchical clustering, self-organizing maps, k-means, principal component analysis, and supp...
متن کاملPsychiatric onset Alexander disease: an important challenge in neuropsychiatric diagnosis
Introduction: Alexander disease is a heterogenous group of diseases with various manifestations based on age of disease onset. This rare leukodystrophy syndrome with mutations in GFAP Gene could present with developmental delay and seizure in infantile form to ataxia and bulbar palsy in adulthood. However psychiatric symptoms are not well-defined and usually evaluate after disease diagnosis not...
متن کاملImage Sculpturer – From 2d Images to 3d Objects
We introduce a new application for creating 3d objects from 2d images. First we give a motivation for such a program. Then we quickly talk about already known image based modeling programs and their drawbacks. Based on this information we describe our approach and our application. The paper ends with a conclusion and some figures of 3d meshes created with the program.
متن کامل